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Abstract 

For probability vectors x and y, the catalytic majorization relation x -<t y is defined to hold 
when there exists a probability vector z such that x <g> z is majorized by y (g> z. In this paper, an 
infinite family of functions is given such that, subject to some trivial restrictions, x -<t y if and only 
if f r (x) < f r (y) for all functions f r in the family. An outline of a proof of this result is provided. 
The catalytic majorization relation is known to provide a determination of which transformations 
of jointly held pure quantum states are possible using local operations and classical communication 
when an additional jointly held state may be specified to facilitate the transformation without being 
consumed. 
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I. INTRODUCTION 



Let x = (xi, . . . , Xd) and y = (yx, . . . , yd) be d- dimensional vectors with real components. 
Let x^ denote the vector obtained by arranging the components of x in decreasing order: 
d ) where x\ > ■ ■ ■ > x d . Then x is said to be majorized by y, written x -< y, 

if 



i=i i=i 

when 1 < k < d and Y^=i x i = Sf=i Vi- The majorization relation has been well studied; 
useful literature on the subject includes [j| and 2]. 

For a ci-dimensional vector x and an ^-dimensional vector z the notation x <S> z denotes 
the <i£-dimensional vector tensor product, which is a vector whose components are all of 
the terms of the form XiZj. For our purposes, the order of the components is irrelevant. 
Our interest is in the following question: given two ^-dimensional probability vectors x and 
y, does there exist a (finite dimensional) probability vector z such that x <8> z -< y <S> zl 
This question motivates the definition of the catalytic majorization relation: we say x is 
catalytically majorized by y, written x -<t y, when such a z exists. 1 Probability vectors of 
different dimensions can be compared with this relation by appending zeros to the shorter 
vector, and in fact we find it convenient to consider catalytic majorization as a relation 
among probability vectors of all finite dimensions. 

In this paper, an infinite family of functions is given with the property that, subject to 
SOme trivia! ^notions, „ if and only if /,(,) < Uy) to, all taction, f in the fannly. 

This result provides an answer to open problem 4 of [6] (also posed in [5(), which asks to 
determine relatively simple conditions to decide whether or not x -<t V- This result also 
essentially resolves a conjecture of Nielsen (as stated in [4|) on that problem; with some 
minor adjustments, that conjecture would hypothesize the present main theorem. 

We regard the main mathematical significance of our result to be in clarifying the nature 
of the catalytic majorization relation, as well as providing a formulation that may be more 
useful mathematically than the definition. However, we also note that our result should 
provide a means of determining whether x -<t V for a given x and y; the definition does not 



1 The catalytic majorization relation is called the trumping relation in 0, 0, S|- 
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suggest a general practical method for making this determination. 

The catalytic majorization relation has its origins in the field of quantum information. In 
recent years in this field has developed rapidly and much research has been directed toward 
understanding what sort of manipulations of quantum-mechanically entangled states are 
possible. This research has in part been motivated by applications of entanglement includ- 
ing quantum teleportation, quantum dense coding, quantum cryptography, and quantum 
computation. 

Both majorization and catalytic majorization have been shown to arise in the study of 
transformations of entangled bipartite pure quantum states. Nielsen obtained the following 
result Q]: 

Theorem 1. Suppose Alice and Bob are in joint possession of an entangled pure quantum 
state that they wish to transform into another bipartite entangled pure state \4>%)- Let 
\ipi) = Y^t=i yfokVAlVs) be a Schmidt decomposition of\ipi) and let \ip 2 ) = Yli=i ^/WiVaiVb) 
be a Schmidt decomposition of \1p2i . Then can be converted to \ip2) (with success guar- 
anteed) using only local operations and classical communication (LOCC) if and only if the 
vector a = (cti, . . . , ad) is majorized by (3 — . . . , [3d). 

Jonathan and Plenio |8|] extended this result by showing that even if it is not possible to 
convert \ip\) to ^2) directly (according to Theorem[T]), it may be possible to convert |^>i)|0) 
to 1^2) where |0) is an additional bipartite state shared by Alice and Bob. If x, y, and 
z are the vectors of (squared) Schmidt coefficients of j^i), |"02), and \<p) respectively, then 
the Schmidt coefficients of \ipi)\<f)) are the components of x <8> z and the Schmidt coefficients 
of 1^2) |0) are the components of y <g> z; thus Nielsen's Theorem implies that l^i)^) can be 
converted to ^2) 10) when x®z -< y®z. The state |0) is not consumed by this transformation; 
for this reason |0) is referred to as a catalyst and we say |0) catalyzes the transformation 
from |-0i) to I %/>2 ) - A state exists that can catalyze the transformation from |-0i) to \ip2) if 
and only if x ~<t V- 
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II. MAIN RESULT 



To state our main theorem we specify a family of functions, indexed by a real number r. 
For a ci-dimensional probability vector x, let 



ln E?=i< (r>l) 

Y^ =1 Xi\nXi (r = 1); 

fr{x) = { -lnX)ii< (0<r<l); 

-E?=i lna; i ( r = 0); 

, ln Eti< (r<0). 

If any of the components of x are 0, we take f r (x) = oo for r < 0. 
Our main theorem is the following: 

Theorem 2. Let x — (xi, . . . , Xd) and y = (yi, ■ ■ ■ ,yd) be d- dimensional probability vectors. 
Suppose that x and y do not both contain components equal to and that x ^ y. Then 
x y if and only if f r (x) < f r (y) for all rGl. 

We refer to the inequalities f r (x) < f r {y) as the f r inequalities. Note that the restrictions 
on x and y in Theorem [2] do not limit the scope of the theorem in any essential way; for 
example, it is clear that adding or removing components that are from x and y does not 
affect the status of the catalytic majorization relation between them. 

We previously reported the statement of Theorem [2] in Q] . 

It should be clear that there are many equivalent choices for the f r functions, since for any 
particular r the composition of any increasing function with f r will give the same inequality 
in Theorem [2J For example, £) i=1 x[ could be used in place of In £) i=1 x[ when r > 1. With 
our definition, for r > the function f r is, up to a positive constant factor that depends on 
r, the negative of the Renyi entropy H r . In particular, the function fx is the negative of the 
Shannon entropy. 

III. RELATED RESULTS IN THE LITERATURE 

There are many related results in the literature; we attempt to list the most relevant 
here. 
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Recently, Aubrun and Nechita [lOj] obtained a result that has many aspects of our The- 
orem [2j To describe their result, it is convenient to consider catalytic majorization to be 
a relation among infinite-dimensional probability vectors with a finite number of nonzero 
components (i.e., with finite support). Aubrun and Nechita considered the set T c (y), defined 
to be the set of all x such that x -<t V- in terms of the f r functions, their main result implies 



that x G T c (y) if and only if f r (x) < f r (y) for r > 1, where T c (y) is defined to be the t\ 
closure of T c (y) within the space of infinite probability vectors with finite support. Stated 
another way, their result is that if x and y are infinite probability vectors with finite support, 
then the following are equivalent: 

(i) For any e > there exists an x' with finite support such that — < e and x' -<t V- 

(ii) The inequalities f r (x) < f r (y) hold for r > 1. 

We point out a subtlety that arises when comparing this result to ours. Let Pd be the set 
of all infinite probability vectors with at most d nonzero components, and let T(y) = {x e 



Pd '■ x -<t y}- Then, as noted in (lOj, even though T(y) = T c (y) fl Pd, the l\ closure T(y) 
is generally a strict subset of T c (y) fl Pd- Essentially, for some (i-dimensional vectors x, one 
cannot find an x' that is "close" to x for which x' -<t y unless x' can have more nonzero 
components than x. Alternatively, this can be regarded as an indication of the importance 
of the f r inequalities with r < 1 for characterizing T(y). 

The result of Aubrun and Nechita that we have stated above can be obtained as a 
consequence of our Theorem [2j However, their main result also applies to multiple-copy 
transformations, which are not addressed by the present work. 

Transformations of states that use catalyst states as described in Section [J are said to be 
transformations that use entanglement-assisted LOCC (ELOCC). Thus Theorem [2] can be 
regarded as a characterization of transformations that are possible under ELOCC. 

If for some positive integer k, the state \ipi)® k (i.e., a state that is k copies of l^i)) can be 
transformed to ^2) under LOCC, then we say that the state can be transformed to 
\ip2) under multiple-copy LOCC (MLOCC), and the transformation is called a multiple-copy 



transformation. It was observed in 
possible under LOCC. Duan et al. 



1 1 that MLOCC allows transformations that are not 
12j showed that any transformation that is possible 



under MLOCC is also possible under ELOCC. Additional results on MLOCC and its close 
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relation to ELOCC can be found in [12, The main result of Aubrun and Nechita [h]] 
implies a characterization of MLOCC that is identical to their characterization of ELOCC. 



Vidal |14| showed that Nielsen's theorem ([7], stated above as Theorem [TJ) can be gener- 
alized to provide the optimum probability of transforming a state to another state \ip2) 
under LOCC. (Nielsen's theorem covers the case where the transformation can be made 
with certainty.) It was noted by Jonathan and Plenio pi that catalyst states can be useful 
in probabilistic entanglement transformations. Properties of catalyst-assisted probabilistic 
entanglement transformations were investigated in 4, Il5l|. 



We do not consider MLOCC or probabilistic entanglement transformations in this paper. 



IV. ADDITIVE SCHUR-CONVEX FUNCTIONS 



A useful tool in the study of majorization is the notion of a Schur-convex function. A 



function / : M. d — > R is Schur-convex if f(x) < f(y) whenever x -< y [1]. Nielsen [5] has 
introduced the notion of an additive Schur-convex function: A function / from probability 
vectors (of any dimension) to K is additive Schur-convex if / is Schur-convex (when restricted 
to probability vectors of a given dimension), and f(x ® y) — f(x) + f(y). If x -<t y then 
we must have f(x) < f(y) for any additive Schur-convex function /, since if x <g> z -< y <S> z 
then f(x)+f(z) = f(x<g>z) < f{y®z) = f(y) + f(z). Clearly, this observation is a 
motivating factor for the conjecture of Nielsen (mentioned in Section [I]) regarding conditions 
for catalytic majorization. 

Our functions f r are known to be additive Schur-convex functions. This can be verified 
with the aid of the following fact (from, e.g., [l|): A differentiable function f(x) is Schur- 
convex if and only if it is invariant to permutations of the components of x and 



[Xi - Xj) — I > 

v OXi OXj 



for any pair of indices (For our application, the definition of additive Schur-convexity 
must be modified slightly to allow for the fact that f r can be 00 when r < 0, but this does 
not cause much difficulty.) 

Clearly we have that f r (x) is continuous in r when r ^ 0,1. However we are really more 
interested in the inequality f r (x) < f r (y) than the actual values of the functions. It turns 
out that this inequality is continuous in r in a certain sense for all r£i. Specifically, if for 
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any neighborhood of r there exists an r' in the neighborhood such that f r > (x) < f r > (y) , then 

fr(x) < f r (y). 

This continuity of the f r inequalities in r can be exhibited as follows. Let f r (x) = 
(l/r(r — l))hi^ i=1 x[, for r ^ 0,1. Clearly f r (x) < f r (y) is equivalent to f r (x) < f r (y) 
for r ^ 0,1. For the case r = 1, we note that the limit of f r (x) as r approaches 1 is 
equal to }\{x). On the other hand, in the limit as r — > 0, / r (x) goes to ±oo (depending 
on which direction the limit is from). However, the difference f r {y) — fr{x) converges to 
(fo(y) ~ fo( x ))/d if neither x nor y contains components equal to zero. Thus, if we use the 
limiting values when r ^ 0,1 then the difference f r (y) — fr{%) is continuous in r over all of 
M. when neither x nor y contains components equal to zero. Given this fact and our main 
theorem, it may be convenient in some cases to assess whether x -<t V by examining a plot 
of f r (y) — fr(x) as a function of r. 



There are other additive Schur-convex 



of functions; examples are given in, e.g., j|5]. We note in particular the functions 

x i— > In x[ , 
- hiar^, 

and 

x i— > — In |suppx| , 

where |suppx| is the number of nonzero components of x. From our main theorem and the 
above discussion, it must be the case that if for some x and y we have f r (x) < f r {y) for 
all rGl, then f(x) < f(y) for any additive Schur-convex function /. This can be easily 
verified directly for the examples above. 



unctions that are not members of our f r family 



V. ORDER-FREE CHARACTERIZATION OF MAJORIZATION 

The usual definition of majorization involves arranging the components of the vectors in 
decreasing order. This definition appears to be inconvenient to work with when considering 

ri 

catalytic majorization. Thus we rely on the following well-known equivalence (see, e.g., [2]), 
where the notation (c) + means the positive portion of c; that is, (c) + = max(c, 0). 

Proposition 3. For d- dimensional vectors x and y the following are equivalent: 
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(i) x -< y; 



(a) T,t=i x i 



Ylt=i Vi an d f or all t EM. 



d d 



£>i-*) + <5><-*) 



+ 



i=i i=i 



Yli=i Vi an d f or a M t EM., 



d d 



5>-*t) + <I>-2/i) 



In the sufficiency portion of the proof of our main result, we use the following formulation. 
If x and y are <i-dimensional probability vectors and z is an ^-dimensional vector, then 
x <S> z -< y <S> z if and only if for all t £ R we have 



VI. NECESSITY 

We first establish the necessity direction of our main theorem: under the hypothesis of 
Theorem [2} x -<t y implies f r (%) < fr(y) f° r a h r £ R. This almost follows from the 
discussion of Schur-convex functions in Section IIV1 However, to show that the inequalities 
must all be strict, we use the following result of Daftuar and the author 

Corollary 4. Let x = (x\, . . . , Xd) and y = (yi, ■ ■ ■ ,yd) be d- dimensional probability vectors. 
Let T(y) denote the set of all d- dimensional probability vectors x' such that x' -<t y- Suppose 
that x -<t V and y\ > x\ and y^ < x d . Then x is in the interior of T(y) (relative to the 
space of probability vectors). 

Sketch of proof of necessity portion of Theorem^- Suppose x -<t V- Because each f r is 
additive Schur-convex, we must have f r (x) < f r {y) for all r £ R. However, we must show 
that equality cannot hold. We assume that the components of both x and y are in decreasing 
order. Without loss of generality, we may assume that no component of x is equal to any 
component of y, as removing such a component from both x and y does not change the status 
of the catalytic majorization relation between them, nor does it affect the f r inequalities. 




(1) 



i=l j=l 



i=l j=l 
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Using this assumption and the fact that f r (x) < f r (y) as r approaches +00 and —00, we find 
that we may assume X\ < y\ and Xd > yd- Let w be the d- dimensional vector (1, 0, ... , 0, — 1). 
From Corollary HI x is not on the boundary of T(y), so for a sufficiently small e > 0, we 
have x + ew -<t y- It is straightforward to verify that f T {x) < f r (x + ew) with this e for all 
reK. But f r (x + ew) < f r (y) since x + ew -<t y, so necessity is established. I 

VII. SUFFICIENCY OUTLINE 

We now outline how the sufficiency portion of Theorem [2] can be proved. That is, under 
the hypothesis of Theorem [2j we outline how to show that if f r {x) < f r {y) for all r 6 R then 
there exists a catalyst vector z such that x <E> z -< y <S> z. 

For convenience, we assume that the components of x and y are arranged in decreasing 
order. As in the necessity proof, we may assume without loss of generality that x\ < y\ and 
Xd > ya- We allow catalyst to vectors to be unnormalized. 

We argue that we need only consider vectors x and y whose components are all nonzero. 
The constraint Xd > yd implies that the components of x are all nonzero. For y, we use the 
following claim. 

Claim. If x and y satisfy the f r inequalities, and x\ < y\ and Xd > yd, then there exists a 
y' (with components assumed to be in decreasing order) whose components are all nonzero, 
with x\ < y[ and Xd > y' d , such that y' -< y, and the f r inequalities are satisfied with y 
replaced by y'. 

This claim is straightforward (but slightly tedious) to verify; we provide a proof in the 
Appendix. Suppose the sufficiency portion of Theorem [2] is shown to hold for probability 
vectors whose components are all nonzero. Then given x and y satisfying the f r inequalities, 
we can find y' with the properties stated in the claim and conclude that there exists a z 
such that x <8> z -< y' ® z. But y' -< y implies y' <8> z -< y ® z, so we have x®z~<y®z&s 
desired. Thus we henceforth assume that all components of y (and of x) are nonzero. 

The sufficiency proof can be divided into steps as follows. In these steps, we assume x 
and y are d- dimensional probability vectors with all nonzero components, the components 
of x and y are arranged in decreasing order, and X\ < y\ and Xd > yd- 

Step 1. Show that if the f r inequalities hold for all r > 0, then there exists a continuous, 
decreasing function z + : [0, 00) — > (0, 1] with z+(0) = 1, along with a constant s + , such that 
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(i) for all t G (0,yi), 




(2) 



(ii) if s > s + then z+(s) = z + (s + )e ( s and 



(iii) the function z + satisfies a Lipschitz condition; that is, there exists a K such that if 
Si, s 2 >0 then |z+(si) - z+(s 2 )\ < if|si - s 2 |. 

Step 2. Show that if the f r inequalities hold for all r < 0, then there exists a continuous, 
increasing function z_ : [0, oo) — > [1, oo) with z_(0) = 1, along with a constant s_, such that 

(i) for all t e (y d , oo), 



(ii) if s > s_ then z_(s) = z_(s^)e s ~ s -; and 

(iii) the function z_ satisfies a Lipschitz condition on the interval [0, s_]. 

Step 3. Show that if ^+ and z_ satisfy the conditions given in Steps 1 and 2, then there 
exists a continuous, decreasing, positive function z*(s) on an interval [0, a], with z*(0) = 1, 
such that 

(i) for all i G (y d z*(a) , y x ) , 



and 

(ii) the function z* satisfies a Lipschitz condition on [0, a] . 

Step 4. Show that if satisfies the conditions given in Step 3, then there exists a 
finite-dimensional z for which x <8> z -< y <8> z. 

Clearly, the results to be proved in Steps 1 through 4 together imply the sufficiency 
portion of Theorem [2j 





(3) 
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VIII. OVERVIEW OF STEPS 1 AND 2 



Here we present an outline of how Steps 1 and 2 can be completed. The presentation 
here is somewhat less detailed than that of the other portions of the proof, but we hope 
that this outline provides a reasonable indication of our general strategy and of the methods 
involved. 

First we provide some motivation for why the results in these steps are useful. The 
basic idea is that conditions (i) of Steps 1 and 2 are reminiscent of the order-free conditions 
for majorization discussed in Section |Vj In fact one could regard condition (i) of Step 1 
as implying that x is catalytically majorized by y with catalyst z + , in an appropriately 
generalized sense. However, we note that this condition by itself does not imply that x -<t V- 
(A similar observation is made in (4, 10].) Conditions (ii) of Steps 1 and 2 together provide 
a way to combine the results of Steps 1 and 2, as will be clear in carrying out Step 3. We 
provide further thoughts related to conditions (i) of Steps 1 and 2 in Section IXII 

In Step 1, we first find a function z + that satisfies condition (i) without regard for con- 
dition (ii). A key idea is to consider functions of the form 



zJs) = e , 



where n is a positive integer. It turns out that if for a given x and y the f r inequalities are 
satisfied for r > 0, then for sufficiently large n condition (i) holds with this z + substituted 
for z + . 

To establish this result, we parameterize t as t = e~ n ^ r ; the range r e (0, 00) corresponds 
to the range t G (0, 1). For a given r there is a correspondence between the inequality ([2]) 
for t = e~ n l r and the f r inequality. Specifically, the following holds: If for some r > the f r 
inequality is satisfied, then for all sufficiently large n the inequality (j2J) holds for t = e~ n l r 
and z + (s) = e~ sl In other words, given r > 0, if f r {x) < f r (y) then for all sufficiently 
large n we have 

£ / - e-^Y ds<J2 (V^ sl/n - e~ n/ T ds. (4) 

i=i Jo i=i Jo 

We require the existence of an n such that (j3J) holds for all r > 0, so it is not enough just to 
show that the above implication holds for each r > 0. 
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Our strategy in showing such an n exists involves dividing the values of r into five regions: 
regions 1, 3, and 5 corresponds conceptually to neighborhoods of 0, 1, and oo, respectively, 
while regions 2 and 4 fill in the gaps. More precisely, the steps in the strategy are as follows. 

Region 5. Show that the inequality y\ > X\ implies that there is an interval = [r 4 , oo) 
for some r 4 G (1, oo), along with a positive integer n 5 , such that if r G -R5 and n > n^, then 
gD holds. 

Region 3. Show that the f r inequality with r = 1 implies that there is an interval 
R3 — [ r 2, r 3], where r 2 G (0, 1) and r 3 G (l,r 4 ), along with a positive integer n 3 , such that if 
r G R3 and n > n^, then (Tj0) holds. 

Region 1. Show that the f r inequality with r = implies that there is an interval 
Ri = (0,rx] for some r x G (0,r 2 ), such that if r G R\ and n > 1, then (jlj) holds. Note the 
condition n > 1; unlike the other regions, n does not need to be large here. 

Region 2. The interval R2 is [r 1; r 2 ]. The task for this step is to show that the f r 
inequalities for r G R2 imply that there exists a positive integer n 2 such that if r G i? 2 and 
n > n 2 , then (jlj) holds. 

Region 4. Similarly, the interval i? 4 is [r3, r 4 ]. The task for this step is to show that the 
f r inequalities for r G i? 4 imply that there exists a positive integer n 4 such that if r G i? 4 
and n > n 4 , then (jlj) holds. 

We proceed with some mathematical preliminaries that we found useful for completing 
the region steps above. For convenience we let 




where c G (0, 1) and n > 1 and t > 0. 



We first observe that for n > 1 and s > 




n— 1 



This can be verified by taking the derivative of the right side with respect to s. 
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Suppose c> t. Then 



a(c, t, n) 



— cm 



{ce' sl/n - 1) ds 



n-l 



j=0 

f £=i 1 



(lnf) r 



cn! 



cn! 



£^'» 

3=0 J 

t n_1 1 i 
i=o J 



C\ 



In-') f 

t / 



In 



Noting that Ejlo W n f) j = C A> i1] follows from (jg) that 



a(c,t,n) =tn\ ^ ~{ 



1 / cv 
in ■ 



j=n+l 



Substituting t = e n l r gives 



a{c, e 



-n/r 



n 



j=n+l J ' 



We found this form of a to be a useful starting point for completing the region 5 step. 
Returning to (0), we substitute t = e~ n l r and find that 



-n/r 



£. — «■ j\ \ r 



3=0 



r ■ 



cn! — e n l r n 



j=0 J k=0 v 



r ■ 



cm 



cm. 



e- n/r n!^(lnc) fc ^- 



fc=0 
n 



j=k 



1 (j\ /ny'-* 
j\\kj V r . 



e^ n!E l (lnc) * E _2_ ( ^ 



fc=0 



j=fc 

n— k 



cm - e n// ' r n 



fc=o " i=o 

where in the last step we have replaced j — k with j. We found the form of a given by 
to be a useful starting point for completing the region 1 and region 2 steps. 
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Now note that J2T=o H( mc ) fe Sjlo = ce - n ^ r ■ Thus we can conclude from ^ that 



k=0 j=max(n+l-fc,0) 



We found this form of a to be a useful starting point for completing the region 3 and region 4 
steps. 

As an example we discuss our approach for the region 4 step in some detail. We start 
with ([7]) where we replace j with j + n + 1 — k: 

a(c, e -»/>)=e-»/V!f;I(lnc)' £ , , ,. {^'^ 

y ' ^ kv ' _ ^ (n + l+j - k)\ \r 



k=0 j'=max(0,fc— n— 1) 

^ k\ v 7 ^ n + l + 7-A;)! 

fc=0 j=max(0,fe-n-l) v 7 

00 



■n-l-j 



V 7 fc=0 j=max(0,fc-n-l) V J 7 

We present a sequence of successively refined results regarding this expression. It is not 
difficult to show that for fixed j and k, 

(n+l)\ni- k 

hm — = 1. 

n-»oo (n + 1 + j — k)\ 

By crudely bounding the difference between ^+1+7-*:)! and 1 for large n, it can be shown 
that for fixed r > 1 and fixed k, 



lim V , in+1)]nl ~lr-^t- 

j=max(0,fc-n-l) v J ' j=0 



By instead carefully bounding the quantities involved in the double sum in ([8]) it can be 
shown that for a fixed c G (0, 1) and a fixed r > 1, 

i- 1 n r\k ST^ (n + l)\n J K , r ^ 1 . r 

hm > — (lnc r ) /c > - — r J = > — (lnc r f = c r . 

n^oo^kV ' ^ (n + l+j - k)\ r-l^kV ' r-1 

k=0 j=max(0,fc-n-l) V J ' k=0 

Continuing along these lines, it can be shown that for a fixed r > 1, if Y^i=i x l < Y^i=iVi 
then for n sufficiently large (jlj) holds. Finally, given r 3 and r 4 with 1 < r 3 < r 4 , it can be 
shown that if the f r inequality holds for all r G [r 3 , r 4 ], then there exists an n 4 such that if 
n > ri 4 then (jl]) holds for all r G [r 3 , r 4 ]. Note that only this last result is needed to complete 
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the region 4 step; the sequence of results is given to provides some intuition as to why the 
last result holds, as well as suggesting how to establish it. 

After all of the region steps are completed, it follows quickly that condition (i) of Step 1 
is satisfied by z+(s) = e~ sl/n for sufficiently large n. We now explain how a z + can be 
constructed that also satisfies condition (ii). We start with a straightforward result that 
shows that whether ([2]) holds for a given t (or range of t) depends only on a portion of the 
function z + . 

Claim. In the context of Step 1, let z^ and z+ be continuous, positive, decreasing functions 
on [0, oo) with ^(0) = z { +\ti) = I. 

(i) Suppose (s) for all s G [0, sq] for some Sq, and that (j2J) holds for 2+ when 
t G [2/1^4? (so)> Hi)- Then (j2j) holds for z+ when t is in this same interval. 

(ii) Fix t and let [si, s 2 ] be the interval for which z+\s) is in [t/yi,t/y d ]. Suppose for a 
given A we have z+\s + A) = z ® (s) for all s G [si, S2] (where necessarily s\-\- A > 0), 
and that (j2J) holds for with our specific t. Then (j2J) holds for and £. 

Sketch of proof . We outline a proof of part (ii); part (i) can be proved by similar means. 
Observe that if c is a component of x or y, then cz+ (s) > t when s < si + A, and cz^ (s) < t 
when s > S2 + A. We therefore have 

^ <>oo 

v / (x4 1} ( S )-t) + d S 

d / /-si+A \ 

= J2( {xiZ<+\s)-t)ds+ / (a^O*)-*)*^) 

j =1 \./0 J Sl +A / 



= -( Sl + A)td + [ 1 z { +\s)ds + J2 [ 2 (x l z { + ) (s) - t) + ds (9) 

and similarly 

/ (^4 0) ( s ) - 1) + ds = / 4 0) ( s ) rfs - s i* d + / (*) - f ) + ds - ( 10 ) 

„-_i JO J_1 ./si 



; i •"' •'" / — j. 

We see from (JSJ) and ( fTTJl) that the difference 



^ /*00 ^ /*00 

V/ {x l z ( l\s)-t) + ds-S^ {x iZ f(s)-t) + ds 

i=l ■'0 i=l ■'O 
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does not depend on x. The analogous expression with x replaced by y has the same value. 
Thus (with our specific t), condition (j2J) holds for z+ if and only if (j2J) holds for z+ . ■ 

To produce the desired z+, we start with z + (s) = e~ sl/n ° , where is chosen to be large 
enough that z + satisfies condition (i) of Step 1. Let z + (s) = z+(s) when s G [0, sq], where 
So is chosen to be large enough that (j2J) holds for all t E [e~ m ^ ri ,yi) regardless of how we 
specify the rest of z + ; the existence of such an s is implied by part (i) of the preceding claim. 
The remaining t fall in the interval (0, e -n °/ ri ) which, with t parameterized as t — e~ n °^ r , 
corresponds to r G R\. Recall that in this region it was not necessary for n to be large for 
(J2J) to be satisfied by z+(s) = e~ sl/n . 

The idea is to extend z + beyond so in such a way that z + gradually changes from the 
form e~ sl/ "° to the form e~ s . We outline how this can be accomplished. We extend z + with 
sections from functions of the form e~ sl/ " (with offsets chosen to make z + continuous), where 
n decreases from section to section and is now no longer necessarily an integer. We make 
each such section long enough that the value of z + decreases by at least a factor of yi/ya 
within the section. Part (ii) of our claim then implies that to verify (j2J) for a given t we need 
only consider the value of z + in at most two sections. The starting point for doing this is 
to show that (j3J) holds for noninteger n > 1 for t in region 1. Then it must be checked that 
when the decrement to n is small enough, ([2]) continues to hold for the relevant values of 
t. Finally, it must be shown that for sufficiently many decrements we can reach a segment 
with n — 1. The segment with n — 1 is used for the remaining s, and part (ii) of our claim 
is again used to show that (j2J) holds. At this point z + can be verified to satisfy all parts 
required by Step 1. 

Step 2 is largely similar to Step 1. A key idea in Step 2 is to consider functions of the 
form 2L(s) = e sl/ ™ where n is a positive integer. If the f r inequalities are satisfied for all 
r < 0, then for sufficiently large n condition (i) in Step 2 holds with this z— As in Step 1, 
our strategy for showing this involves five regions of values of r, in this case with regions 1', 
3', and 5' corresponding conceptually to neighborhoods of 0, —1, and — oo. A z_ satisfying 
condition (ii) of Step 2 is constructed analogously to the construction of the z + satisfying 
condition (ii) of Step 1. 
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IX. STEP 3 



The function z* in this step may be regarded as a continuous precursor to a finite- 
dimensional catalyst vector z. The construction of z* is roughly as follows. The function z + 
from Step 1 is used for the left side of the interval on which z* is defined, and the function 
Z- from Step 2 is reflected horizontally, scaled appropriately, and used for the right side of 
the interval on which z* is defined. There is a region in the center, in which z* is equal to a 
decaying exponential function, where the two halves coincide. The center region allows the 
transition from the submajorization-like condition involving z + to the supermajorization-like 
condition involving z_ to be made. 

We proceed with the details. Let a = s + + s_ + ln(|/i/|/d). The function z* is defined on 
the interval [0, a]. We let 



2/1 z_(s_) 

Observe that if s G [s+, a — then 



z + (s) if s G [0, a — s_]; 

M£±(£±),2_( a _ s ) if s e (a — s_, a]. 



(11) 



-z_ia — s) = ' : ' : z_\sJ)e a s ~ 



= y± z+ ( s+ y++Myi/y d )s 

= z + (s + )e-^ 
= z+(s), 

and therefore when s G [s + , a — s_], either part of ffTTT) can be used. This is the aforemen- 
tioned region where the two parts of z*(s) coincide. Also observe that if s G [s+,a] then 
z*(s) = z Jf (a)z_(a — s). Note that z* is continuous and decreasing and satisfies a Lipschitz 
condition. 

Let t* = yiZx(a — s_). Observe that also 

t* = yi ^(a - s_) = yi z + (s + )e-^- s - s ^ = y lZ+ (s + )e- 1 ^^ = y d z+(s + ). 

Suppose c is a component of x or y, so that in particular < c < y\. If s G [a — s_, a] 
and t > t* then 

cz*( s ) —t< yiz*(a - s_) - t* = 0, 
17 



and similarly if s > a — s_ and t > t* then cz + (s) — t < 0. Since z*(s) = z+(s) when 
s G [0, a — s_], for all t > t* we have 

/ (c2*(s) — t) + rfs = / (cz + (s) — t) + ds. 
Jo Jo 

Now the result from Step 1 implies that if t G [t*,yi) then 

^/ (x iZ *{s)-t) + ds<J2 (ViZ*{s)-t) + ds. (12) 
i=l 7o i=l Jo 

Next we find an analogous result that uses the result from Step 2. The derivation is 
slightly more involved due to the fact that z_ is reflected and scaled to form the right half of 
z*. Suppose again that c is a component of x or y, so that in particular c > y d . If s G [0, s + ] 
and t < t* then 

t - cz*(s) <t* - y d z*(s + ) = 0. 

Thus we have 



pa pa 

/ (t - cz*(s)) + ds = / (t - cz*(s)) + ds 

Jo J s + 

= / (t — cz*(a)z_(a — s)) + ds 



z*(a) I [ - ^ — cz_(a — s) ) rfs 



= 2* (a) / ( — ^— — cz-(u)) du, 
Jo \z*{a) J 

where we have made the substitution u = a — s. 

For u > a — s + and t < t* we have 

* / x ^ g / \ VdZ+{s + ) z«(s+) 

cz_(u) < —— - y d z„(a - s+) = — y d — — = 0. 



z*(a) ' z*(a) z*(a) z*(a) 

Therefore 

zJa) [ ( — ^— — cz-(u)] du = z*(a) [ ( — ^— - cz-(u) } du 

Jo \z*{a) J Jo \z*{a) 



and so 



(t — cz*(s)) + ds = z*(a) I ( — —— — cz-(u) ) du. 



Thus the result from Step 2 implies that if t G (ydZ*(a),t*] then 



^ pa ^ pa 

(t-x lZ *(s)) + ds<J2 (t-y l z*(s)) + ds. (13) 

i=l Jo i=l ^ 
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Finally we show that for any given t, conditions ffl2|) and ({TBI are equivalent. Observe 
that c = (c) + — (— c) + for any real c; thus 



from which the equivalence of (fl2|) and (fT3|) is clear. Thus in particular (TT2|) holds for all 

t G (ydZ*(a), yi), so Step 3 is complete. 

X. STEP 4 

Step 3 has provided a function z* that can be regarded as a continuous catalyst, as the 
condition ([3]) is roughly analogous to the condition (TjQ) for a finite-dimensional vector z. It 
is fairly straightforward to produce a suitable catalyst vector z from z*. We provide the 
details in this section. 

From the result of Step 3 we have that ([3]) holds for all t G {yd,z* (a), yi); therefore 
in particular ([3]) holds for all t G [xdZ*(a), xi\. This latter interval is compact, and 
Ylt=i Io( x i z *( s ) ~ ^) + d- s an d Yui=i $o{Vi z *( s ) ~ t) + ds are continuous functions of t, so we 
may pick 5 > so that 



when t G [xdZ*(a), xi\. 

Now let £ be an integer that is large enough to imply that |z*(si) — z*(s2)| < 5/2yiad 
whenever \si — S2I < o/l; such an £ must exist from the Lipschitz condition on z*(s). This i 
will be the dimension of our catalyst z. For each j G {1, . . . , £} pick Zj as any value in the 
interval [z*((j/£)a), — l)/£)a)]. Our catalyst z is the vector (21, . . . , Zg). 





Subtracting this equation from the analogous equation for y yields 





(14) 
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We now verify that x <8> z -< y Cg> z. Since the sum of all components of x <8> z is equal to 
the sum of all components of y <g> z, we need only verify that for all t, 

E E^ - *) + - E - *) + ^ °- ( 15 ) 

i=l j'=l i=l j=l 

For £ > sci, the second sum is zero, so (IT51) must hold. For t < XdZ*(a) we have 

d i d £ d I d I 

E E^ ~ ^ + - E Ete ~ *) = E E^ ~ *) = E E^ ~ 

i=l j'=l i=l j=l i=l j=l i=l j=l 

so again HT5]) holds. 

For the remaining values of t, namely t G (^z*(a), X\), we show that the (appropriately 
scaled) values of the double sums in f|T5|) are close to the values of the corresponding sum- 
integrals, then we use ffT4l) to establish (|T5l) . We have 



i=i i=i 



= EE / _! - *) + - (^*( s ) - *) + ) 

< / J / ] / \{xiZj — t) + — {xiZ*{s) — t) + \ds 



i=i j=i J — a 

d I „2, 



< 



< 



EE / \%iZj - XiZ*(s)\ ds 



=1 i=i 
d e .i 



EE/ 7 " ^ds 



i=i j=i j — ' 

= 5/2. 



lad 



Similarly, 



d I 



E Ete ~ ^ + ~ E / (yi z *( s ) - f ) + ds 

i=l j=l 



i=l 



< 5/2. 
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Thus 



a 



( 



i=l j = l 1=1 j = l 





+ 



) 




> —5 + 2J / (Vi z *( s ) - t) + ds - / (xjZ*(s) - t) + rfs 




> 



so (HH) holds. 

Thus we have shown that (|15p holds for alH, so x <S> 2 -< y <S> 2; as desired. 
XI. DISCUSSION 

A proof of Theorem [2] that follows along the lines outlined here appears to be constructive 
enough that one could produce a crude upper bound to the minimum dimension of the 
catalyst, as a function of x and y. It may take a substantial effort to do this. It may be of 
interest to determine an asymptotic growth rate of the minimum dimension of the catalyst 
as, say, x approaches the boundary of T(y). 

The characterization of catalytic majorization provided by Theorem [2] requires verifying 
a continuum of inequalities before concluding that x -<t V- In practice, for a particular 
x and y, one can generally use the properties of the f r to check all f r inequalities with a 
finite amount of work. Given this, one might speculate that some of the inequalities are 
redundant, but we think that this is not the case: we conjecture that for any tq G R, there 
exists an x and y such that f r (x) < f r (y) holds for all r except r = r (note that this would 
imply that f ro (x) = f ro (y))- 

The methods used here suggest several possible additional mathematical investigations. 
We mention one direction that seems intriguing. The idea is that it may be fruitful to define 
weak catalytic majorization. As background, we note that the usual weak majorizations 
(submajorization and supermajorization) can be naturally defined for infinite vectors x and 
y as follows: 

An infinite vector x = (xi,X2, . . .) can be defined to be submajorized by an infinite vector 
y if for all k > 1 we have Yli=i x i — Yl^iVi- This definition makes sense only if for any 
component of x, there are a finite number of larger components of x, and similarly for y. 
If the components of x and y are nonnegative and have finite sums, then this definition is 
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equivalent to the order-free formulation that requires Y^i( x i ~ ^) + — Y^LiiVi ~ + f° r 
all t > 0. Similarly, x can be defined to be supermajorized by y if for all k > 1 we have 
52i=i x l < Si=i2/|- This definition makes sense only if for any component of x, there are 
a finite number of smaller components of x, and similarly for y. If the components of x 
and y are nonnegative and unbounded, then this definition is equivalent to the order-free 
formulation that requires Ylili(t ~ x i) + — ~ Vi) + f° r an t > 0. 

With these notions in place, we can state a possibility for weak catalytic majorization. 
Suppose x and y are <i-dimensional vectors with nonnegative components. It is not assumed 
that the sum of the components of x equals the sum of the components of y. We consider 
infinite catalyst vectors z = (z\,Z2,---) with nonnegative components (and at least one 
nonzero component). A possible definition is that x is catalytically submajorized by y if 
there exists a catalyst z such that x ® z is submajorized by y <g> z and Yl'jLi z j * s finite for 
all r > 0. Also x is catalytically supermajorized by y if there exists a catalyst z such that 
x <S> ^ is supermajorized by y <S> z and Y2°jLi z j is finite for all r < 0. 

We think that if the condition on z is chosen properly in both cases (we are not sure if 
we have it right as written), then x being both catalytically submajorized and catalytically 
supermajorized by y will imply x -<t V- 

Note that with these definitions it is possible that, say, x is catalytically submajorized 
by y and the sum of the components of x is equal to the sum of the components of y, but 
x -A T y. 

On a related note, we ask: if for some r > we only require the sum YlJLi z ] to be finite 
when r > r , then are the f r inequalities for r > r sufficient to imply the existence of a 2 
for which x ® z is submajorized by y ® z7 Analogously, if for some ro < we only require 
the sum Y^TLi z j to be finite when r < tq, then are the f r inequalities for r < r$ sufficient 
to imply the existence of a z for which x ® z is supermajorized by y <8> z7 

APPENDIX 

Here we prove a claim that was used in Section IVHI to show that in our sufficiency proof 
we need only consider those y whose components are all nonzero. We restate the claim here. 

Claim. Suppose x and y are d-dimensional probability vectors with components in decreasing 
order, and such that x x < y x and Xd > yd and for all r G R the inequality f r (x) < f r (y) holds. 
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Then there exists a d- dimensional probability vector y' whose components are all nonzero, 
with x\ < y[ and x d > y' d (we assume the components of y' are in decreasing order), such 
that f r (x) < f r (y') for all r e R, and such that y' -< y. 

Proof. If all components of y are nonzero, we can simply take y' = y, and we are done. 
Therefore we suppose k components of y are 0, where k > 1. Let iu be the d-dimensional 
vector for which the first d — k components are 0, and the remaining k components are 1. For 
n > 1 let y^ = ^-y + ^w. Clearly, for sufficiently large n, say n > N , the components 
of y^ are in decreasing order and y^ -< y. Note also that y^> -< y(™ +1 ) for n > N . 

Choose Ni to be large enough that 1/N ± k < Choose r\ < so that if r < T\ then 

1 V 

Wk) >x * d - 

Choose ^ to be large enough that ^fr^yi > %i- Choose r 2 > 1 so that if r > r 2 then 



N 2 

No - 1 



2 



j/i > x[d. 



For general x, y, and r we define 

'^(InEii^-lnEiX), ^ 0,1; 
^.I/.O = \ -UTti^Vi - Etiln^), if r = 0; 

^LiVi^yi - J2t=i x i lnx i, ifr = l. 

If all components of x are nonzero then F(x,y,r) > is equivalent to f r (x) < f r (y)- It is 
straightforward to verify that for fixed x and y, both with all components nonzero, F(x, y, r) 
is continuous in r over all of R. For fixed x and r the function F(x, y, r) is Schur-convex in 

y- 

For our specific x and y (with y'™) constructed from y) let g„(r) = F(x,y^ n \r). We 
then have that for all n > N the function g n (r) is continuous in r. Also if n > N then 
yW _<; y(n+i) anc j go ^ n ( r ) < g n+1 (r) for all r e R. Finally, for all r e R it can be verified 
that lim n _ ) . 0O y n (r) > (specifically, the limit is F(x,y,r), and in particular it is +oo when 
r < 0). 

From these properties of g n ( r ), a subclaim that we state below implies that there exists 
an iV such that for all re [n, r 2 ] we have g n (r) > when n> N. 
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Let y' = j/W for n = max(N , N u N 2 , N). Now if r G [r u r 2 ] then g n (r) > so that 
f r (x) < f r (y')- If r < ri then we have 



and again we have f r (x) < f r (y')- Thus f r (x) < f r (y') for all r G R, so the proof is complete 
except for establishing the subclaim. 

Subclaim. Suppose [a, b] is an arbitrary closed interval, and g n (r) satisfies the following: 

(i) for all n > N the function g n (r) is continuous in r over all of R; 

(ii) g n (r) is increasing in n when n > N and r G R; and 

(iii) lim ri _ >00 5f n (r) > for all r G R. 

Then there exists an such that g n (r) > for all r G [a, b] and n > N. 

Proof of subclaim. For all n > iVo let E n = { r | g n (r) > }. Then from (i) each E n is an 
open set, and it follows from (ii) that E n C From (iii) we have [J n>No E n = R, so in 

particular [a, b] C Un^jVo-^n- 

Because the interval [a, 6] is compact, there exists a finite subset of {E n } n >N that covers 
[a, 6]. In view of the fact that E n C E'n+i, there is thus an iV for which [a, 6] C E N , which 
implies that [a, b] C -E„ for n> N. Thus the claim holds using this choice of N. I 

Remarks. Our subclaim appears to be a simpler relative of Dini's theorem, and our proof 
of the subclaim is similar to a standard proof of Dini's theorem. The fact that when n is 
large enough y^ has the desired properties can also be proved without the subclaim using 
the definition of the f r directly. 
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which implies f r (x) < f r (y')- If r > r 2 then we have 
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